Whole genome Identity-by-Descent determination
نویسندگان
چکیده
High-throughput single nucleotide polymorphism genotyping assays conveniently produce genotype data for genome-wide genetic linkage and association studies. For pedigree datasets, the unphased genotype data is used to infer the haplotypes for individuals, according to Mendelian inheritance rules. Linkage studies can then locate putative chromosomal regions based on the haplotype allele sharing among the pedigree members and their disease status. Most existing haplotyping programs require rather strict pedigree structures and return a single inferred solution for downstream analysis. In this research, we relax the pedigree structure to contain ungenotyped founders and present a cubic time whole genome haplotyping algorithm to minimize the number of zero-recombination haplotype blocks. With or without explicitly enumerating all the haplotyping solutions, the algorithm determines all distinct haplotype allele identity-by-descent (IBD) sharings among the pedigree members, in linear time in the total number of haplotyping solutions. Our algorithm is implemented as a computer program iBDD. Extensive simulation experiments using 2 sets of 16 pedigree structures from previous studies showed that, in general, there are trillions of haplotyping solutions, but only up to a few thousand distinct haplotype allele IBD sharings. iBDD is able to return all these sharings for downstream genome-wide linkage and association studies.
منابع مشابه
Genome-wide identity-by-descent sharing among CEPH siblings.
The concept of genetic identity-by-descent (IBD) has markedly advanced our understanding of the genetic similarity among relatives and triggered a number of developments in epidemiological genetics. However, no empirical measure of this relatedness throughout the whole human genome has yet been published. Analyzing highly polymorphic genetic variations from the Centre d'études du polymorphisme ...
متن کاملGenomic mismatch scanning identifies human genomic DNA shared identical by descent.
Genomic mismatch scanning (GMS) is a high-throughput, high-resolution identity by descent mapping technique that enriches for genomic DNA fragments that are shared between related individuals. In GMS, DNA heteroduplexes are formed from restriction-digested genomic DNA fragments from two relatives. Mismatch-free DNA heteroduplexes, likely representing DNA shared identical by descent between the ...
متن کاملPLINK: a tool set for whole-genome association and population-based linkage analyses.
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, ...
متن کاملIdentity by descent in the mapping of genetic traits
This report shows how the descent of genome from an ancestor to currently observed descendants results in identity by descent (IBD) in current individuals, and hence similarities in their DNA at genetic marker loci. Conversely, data on the marker genotypes of individuals provides inferences of shared descent of genome in current individuals, not just genome-wide, but in specific genome regions....
متن کاملLeveraging Identity-by-Descent for Accurate Genotype Inference in Family Sequencing Data
Sequencing family DNA samples provides an attractive alternative to population based designs to identify rare variants associated with human disease due to the enrichment of causal variants in pedigrees. Previous studies showed that genotype calling accuracy can be improved by modeling family relatedness compared to standard calling algorithms. Current family-based variant calling methods use s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of bioinformatics and computational biology
دوره 11 2 شماره
صفحات -
تاریخ انتشار 2013